Robust Reinforcement Learning: A Case Study in Linear Quadratic Regulation

نویسندگان

چکیده

This paper studies the robustness of reinforcement learning algorithms to errors in process. Specifically, we revisit benchmark problem discrete-time linear quadratic regulation (LQR) and study long-standing open question: Under what conditions is policy iteration method robustly stable from a dynamical systems perspective? Using advanced stability results control theory, it shown that for LQR inherently robust small process enjoys small-disturbance input-to-state stability: whenever error each bounded small, solutions algorithm are also bounded, and, moreover, enter stay neighborhood optimal solution. As an application, novel off-policy optimistic least-squares proposed, when system dynamics subjected additive stochastic disturbances. The proposed new validated by numerical example.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Reinforcement Learning Applied to Linear Quadratic Regulation

Recent research on reinforcement learning has focused on algorithms based on the principles of Dynamic Programming (DP). One of the most promising areas of application for these algorithms is the control of dynamical systems, and some impressive results have been achieved. However, there are significant gaps between practice and theory. In particular, there are no con vergence proofs for proble...

متن کامل

Linear Quadratic Regulation using Reinforcement

In this paper we describe a possible way to make reinforcement learning more applicable in the context of industrial manufacturing processes. We achieve this by formulating the optimization task in the linear quadratic regulation framework, for which a conventional control theoretic solution exist. By rewriting the Q-learning approach into a linear least squares approximation problem, we can ma...

متن کامل

Robust Reinforcement Learning Decoupling Control Based on Integral Quadratic Constraints

Abstract. In order to keep stable in reinforcement learning process, a novel robust reinforcement learning decoupling control (RRLDC) based on integral quadratic constraints(IQC)is presented in this paper. It composes of a linear model to approximate the nonlinear plant, a state feedback K controller to generate the basic control law, and an adaptive critic unit to evaluate decoupling performan...

متن کامل

Efficient Reinforcement Learning for High Dimensional Linear Quadratic Systems

We study the problem of adaptive control of a high dimensional linear quadratic (LQ) system. Previous work established the asymptotic convergence to an optimal controller for various adaptive control schemes. More recently, for the average cost LQ problem, a regret bound of O( √ T ) was shown, apart form logarithmic factors. However, this bound scales exponentially with p, the dimension of the ...

متن کامل

Reinforcement Learning in Neural Networks: A Survey

In recent years, researches on reinforcement learning (RL) have focused on bridging the gap between adaptive optimal control and bio-inspired learning techniques. Neural network reinforcement learning (NNRL) is among the most popular algorithms in the RL framework. The advantage of using neural networks enables the RL to search for optimal policies more efficiently in several real-life applicat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2021

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v35i10.17122